Reinforcement learning - PDFSEARCH.IO - Document Search Engine

Reinforcement learning
Results: 1147

#	Item
131	Temporal Difference Learning to Detect Unsafe System States Huazhong Ning∗ , Wei Xu† , Yue Zhou∗ , Yihong Gong† , Thomas Huang∗ ∗ ECE Department, U. of Illinois at Urbana-Champaign, Urbana, IL 61801. {hning2, Add to Reading List Source URL: www.ifp.illinois.edu Language: English - Date: 2008-08-06 16:36:02 Applied mathematics Artificial intelligence XT Learning Reinforcement learning Artificial neural network Machine learning Supervised learning
132	German Journal on Artificial Intelligence (KI), Springer, to appearNoname manuscript No. (will be inserted by the editor) Online Learning of Bipedal Walking Stabilization Add to Reading List Source URL: www.ais.uni-bonn.de Language: English - Date: 2015-06-26 14:32:15 Robot control Humanoid robot Mobile robot Robotics Zero moment point Bipedalism Walking Reinforcement learning Artificial intelligence Biota Learning
133	Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains Add to Reading List Source URL: www.intelligence.tuc.gr Language: English - Date: 2012-04-19 16:26:14 Statistics Statistical theory Probability Bayesian statistics Dynamic programming Markov processes Stochastic control Markov decision process Reinforcement learning Q-learning Prior probability Conjugate prior
134	Sequential Decision Making with Untrustworthy Service Providers W. T. Luke Teacy, Georgios Chalkiadakis, Alex Rogers and Nicholas R. Jennings Electronics and Computer Science,University of Southampton Southampton, SO17 1 Add to Reading List Source URL: www.intelligence.tuc.gr Language: English - Date: 2008-02-08 15:13:13 Cognitive science Artificial intelligence Cognition Philosophy Game theory Computational trust Computer access control Key management Software agent Reinforcement learning Practical reason Trust
135	Around Inverse Reinforcement Learning and Score-based Classification Matthieu Geist IMS - MaLIS Research Group (Supélec) Metz, France Add to Reading List Source URL: www.metz.supelec.fr Language: English - Date: 2014-01-18 03:53:53 Machine learning Artificial intelligence Learning Applied mathematics Computational neuroscience Dynamic programming Stochastic control Reinforcement learning Apprenticeship learning Markov decision process Artificial neural network Supervised learning
136	Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare and Georg Ostrovski and Arthur Guez Philip S. Thomas∗ and R´emi Munos Google DeepMind {bellemare,ostrovski,aguez,munos}@google.com; Add to Reading List Source URL: psthomas.com Language: English - Date: 2015-12-12 00:05:18 Mathematics Mathematical optimization Dynamic programming Mathematical analysis Equations Operations research Systems theory Stochastic control Bellman equation Markov decision process Q-learning Reinforcement learning
137	Evolutionary Feature Evaluation for Online Reinforcement Learning Add to Reading List Source URL: eldar.mathstat.uoguelph.ca Language: English - Date: 2016-07-12 12:05:04 Cognitive science Cognition Artificial intelligence Machine learning Belief revision Reinforcement learning Temporal difference learning Q-learning Feature selection Supervised learning Proto-value functions Action selection
138	Enhancing Agent Safety through Autonomous Environment Adaptation Benjamin Rosman Bradley Hayes Add to Reading List Source URL: bradhayes.info Language: English - Date: 2016-07-11 15:51:27 Belief revision Reinforcement learning Caregiver Robotics Human development Psychoanalysis Learning Personal life Behavior
139	Trust Region Policy Optimization arXiv:1502.05477v4 [cs.LG] 6 Jun 2016 John Schulman JOSCHU @ EECS . BERKELEY. EDU Add to Reading List Source URL: arxiv.org Language: English - Date: 2016-06-06 20:48:19 Numerical analysis Applied mathematics Statistics Markov models Mathematical optimization Operations research Reinforcement learning Expectationmaximization algorithm Sine Proximal gradient method Gradient method Loss function
140	Advances in Theoretical Economics Volume , Issue   Article  Add to Reading List Source URL: people.bu.edu Language: English - Date: 2006-03-02 12:30:18 Markov models Behaviorism Markov processes Psychology Addiction Behavior therapy Reinforcement Markov chain Behavior Learning

UPDATE